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EG327 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVATAVLA 

BZ19 8 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA 

B210 MNKISRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA 

HIS MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA 

EG329 MNEILRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA 

PMC21 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA 

H38 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA 

P2 0 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA 

Z24 91 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA 

H41 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA 

Consensus MN-I-RIIWN SALNAWV-VS ELTRNHTKRA SATV-TAVLA 



50 

T L L FAT VQ AS . 

TLLFATVQAN 

TLLFATVQAN 

TLLFATVQAN 

TLLFATVQAS 

TLLFATVQAS 

TLLFATVQAN 

TLLSATVQAN 

TLLFATVQAN 

T L L FAT VQ AN 

TLL-ATVOA- 



Cl 



EG327 
BZ198 
BZ10 

HIS 
EG329 
PMC21 

H38 

P20 
Z2491 

H41 

Consensus 



DL 
DL 
DL 
DL 



51 

TTDDD . 
ATDDD. 
ATDDD , 
ATDDD . 
ANNEEQEEDL 
ANNEEQEEDL 
ATDED . . EEE 
ATDTD. . EDE 
ATDED . . EEE 
ATDED. . EEE 



YLEPVQRTAV 
YLEPVQRTAV 
YLEPVQRTAV 
YLEPVQRTAV 
YLDPVLRTVA 
YLDPVQRTVA 
ELEPWRSAL 
ELESVARSAL 
ELESVQR.SV 
ELESVQR. SV 
-L — V-R 



VLSFRSDKEG TGEKE.VTED 
VLSFRSDKEG TGEKE.GTED 
VLSFRSDKEG TGEKE.GTED 
VLSFRSDKEG TGEKE.GTED 
VLIVNSDKEG TGEKEKVEEN 
VLIVNSDKEG TGEKEKVEEN 
VLQFMIDKEG NGENE.STGN 
VLQFMIDKEG NGEIESTGDI 
VGSIQASMEG SGELET ... I 
VGSIQASMEG SVELET. . .1 
V EG — E-E 



100 

SNWGVYFDKK 
SNWAVYFDEK 
SNWAVYFDEK 
SNWAVYFDEK 
SDWAVYFNEK 
SDWAVYFNEK 
IGWSIYYDNH 
GWSIYYDDHN 
SLSMTNDSKE 
SLSMTNDSKE 



VI 
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EG327 


GVLTAGTITL 


KAGDNLKIKQ 


BZ198 


RVLKAGAITL 


KAGDNLKIKQ 


BZ10 


RVLKAGAI TL 


KAGDNLKIKQ 


HIS 


RVLKAGAITL 


KAGDNLKIKQ 


EG329 


GVLT ARE I T L 


KAGDNLKIKQ 


PMC21 


GVLTAREITL 


KAGDNLKIKQ 


H38 


NTLHGATVTL 


KAGDNLKIKQ 


P20 


TLHG.ATVTL 


KAGDNLKIKQ 


Z2491 


FVDPYIWTL 


KAGDNLKIKQ 


H41 


FVDPYIWTL 


KAGDNLKIKQ 


Consensus 


TL 


KAGDNLKIKO 




VI 


C2 



NTNENTNASS 
NTNENTNDSS 
NTNENTNENT 
NTNENTNENT 
NG. . .TN. . . 
NG. . .TN. . . 
NTNKNTNENT 

SGKD 

NTNENTNASS 
NTNENTNASS 

V2 



FTYSLK 

.... FTYSLK 
NDSSFTYSLK 
NDSSFTYSLK 
.... FTYSLK 

FTYSLK 

NDSSFTYSLK 
.... FTYSLK 
.... FTYSLK 
.... FTYSLK 
FTYSLK 



150 

KDLTDLT SVG 
KDLTDLTSVE 
KDLTDLT SVE 
KDLTDLTSVE 
KDLTDLT SVG 
KDLTDLTSVG 
KDLTDLTSVE 
KELKDLTSVE 
KDLTGLINVE 
KDLTGLINVE 
K-L — L — V- 
C3 



FIG. 1 



2/31 



151 200 

EG327 TEKLSFSANS NKVNITSDTK GLNFAKKTAE TNGDTTVHLN GIGSTLTDTL 

BZ198 TEKLSFGANG NKVNITSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

BZ10 TEKLSFGANG NKVNITSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

H15 TEKLSFGANG NKVNITSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

EG329 TEKLS FSANG NKVNITSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

PMC21 TEKLS FSANG NKVNITSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

H38 TEKLSFGANG NKVNITSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

P20 TEKLSFGANG NKVNITSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

Z2491 TEKLSFGANG KKVNIISDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

H41 TEKLSFGANG KKVNIISDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDML 

Consensus TEKLS F- AN KVNI-SDTK GLNFAK-TA- TNGD-TVH LN GIGSTI.TD-T. 

C3 ' 



201 

EG327 LNT GATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS DNV 
BZ198 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAs"Sw 
BZ10 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS DNV 
H15 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS ' DNV 
EG329 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS " DNV 
PMC21 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS " DNV 
H38 LNTGATTNVT NDNVTDDKKK RAASVKDVLN AGWNIKGVKP GTTAS " DNV 
AGSSASHVDA GNQST..HYT RAASIKDVLN AGWNIKGVKT GSTTGQSENV 
Z2491 AGSSASHVDA GNQST..HYT RAASIKDVLN AGWNIKGVKT GSTTGQSENV 
H41 LNTGATTNVT NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTAS DNV 

Consensus A T RAAS-KDVT.N AGWNIKGVK- g-T --NV 

V3 _ C4 V4 



EG327 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT SVIKEKDGKL 

BZ198 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT SVIKEKDGKL 

BZ10 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT SVIKEKDGKL 

H15 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT SVIKEKDGKL 

EG329 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT SVIKEKDGKL 

PMC21 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT SVIKEKDGKL 

H38 DFVHTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT SVIKEKDGKL 

P20 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT SVIKEKDGKL 

Z2491 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT SVIKEKDGKL 

H41 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT SVIKEKDGKL 

Consensus DFV-TYDT VE FLSADTKTTT VNVESKDNGK -TEVKIGAKT SVIKEKDGKL 

C5 
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301 350 

EG327 VTGKDKGEND SSTDKGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

BZ198 VTGKGKDENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

BZ10 VTGKGKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

H15 VTGKGKDENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

EG32 9 VTGKDKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

PMC21 VTGKDKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

H3 8 VTGKGKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

P20 VTGKGKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

Z2 4 91 VTGKGKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

H41 VTGKGKGENG SSTDEGEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 

Consensus VTGK-K-EN- SSTD-GEGLV TAKEVIDAW KAGWRMKTTT ANGQTGQADK 



C5 



EG327 
BZ198 
BZ10 

HIS 
EG32 9 
PMC21 

H38 

P20 
Z2491 

H41 

Consensus 



351 

FETVTSGTNV 
FETVTSGTNV 
FETVTSGTKV 
FETVTSGTKV 
FETVTSGTNV 
FETVTSGTNV 
FETVTSGTNV 
FETVTSGTKV 
FETVTSGTNV 
FETVTSGTKV 
FETVTSGT-V 



TFASGKGTTA 
TFASGKGTTA 
TFASGNGTTA 
T FAS GNGTTA' 
TFASGKGTTA 
TFASGKGTTA 
TFASGKGTTA 
TFASGNGTTA 
TFASGKGTTA 
TFASGNGTTA 
TFASG-GTTA 



TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 
TVSKDDQGNI 



TVMYDVNVGD 
TVKYDVNVGD 
TVKYDWVGD 
TVKYDVNVGD 
TVMYDVNVGD 
TVMYDVNVGD 
TVKYDVNVGD 
TVKYDVNVGD 
TVMYDVNVGD 
TVKYDVNVGD 
TV-YDVNVGD 



400 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS 

ALNWQLQNS— 

ALNWQLQNS 

ALNWQLQNS 



C5 



EG327 
BZ198 
BZ10 

H15 
EG329 
PMC 21 

H38 

P20 
Z2491 

H41 

Consensus 



401 

GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 
GWNLDSKAVA 



GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 
GSSGKVISGN 



VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 
VSPSKGKMDE 



TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 
TWINAGNNI 



450 

EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EITRNGKNID 
EISRNGKNID 
EITRNGKNID 
EI-RNGKNID 
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451 

EG32 7 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGS KDANK 

BZ198 IATSMAPQFS SVSLGAGADA PTLSVDDEGA LNVGS KDTNK 

BZ10 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGS KDANK 

H15 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

EG329 IATSMTPQFS SVSLGAGADA PTLSVDG . DA LNVGS KKDNK 

PMC21 IATSMTPQFS SVSLGAGADA PTLSVDG. DA LNVGS KKDNK 

H38 IATSMTPQFS SVSLGAGADA PTLSVDDKGA LNVGSKDANK 

P2 0 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

Z24 91 IATSMAPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

H41 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

Consensus IATSM-POFS SVSLGAGADA PTLSVD A LNVGS K — NK 



500 

PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 
PVRITNVAPG 



C5 



501 550 

EG327 VKEGDVTNVA QLKGVAQNLN NHIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

BZ198 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

BZ10 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLAQAYLPG 

H15 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLAQAYLPG 

EG32 9 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

PMC21 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

H38 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

P2 0 VKEGDVTNVA QLKGVAQNLN NRIDNVNGNA RAGIAQAIAT AGLAQAYLPG 

Z2491 VKEGDVTNVA QLKGVAQNLN NRIDNVDGNA RAGIAQAIAT AGLVQAYLPG 

H41 VKEGDVTNVA QLKGVAQNLN NRIDNVNGNA RAGIAQAIAT AGLVQAYLPG 

Consensus VKEGDVTNVA QLKGVAQNLN N-IDN V-GNA RA GIAQAIAT AGL-QAYLPG 

C5 



EG327 
BZ198 
BZ10 

H15 
EG329 
PMC21 

H38 

P20 
Z2491 

H41 

Consensus 



551 

KSMMAIGGGT 
KSMMAIGGDT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGGGT 
KSMMAIGG-T 



YRGEAGYAIG 
YRGEAGYAIG 
YRGEAGYAIG 
YRGEAGYAIG 
YRGEAGYAIG 
YRGEAGYAIG 
YRGEAGYAIG 
YLGEAGYAI G 
YRGEAGYAIG 
YLGEAGYAI G 
Y-GEAGYAIG 



YSSISDGGNW IIKGTASGNS 
YSSISDGGNW IIKGTASGNS 
YSSISDTGNW VIKGTASGNS 
YSSISDTGNW VIKGTASGNS 
YSSISDGGNW IIKGTASGNS 
YSSISDGGNW IIKGTASGNS 
YSSISDGGNW IIKGTASGNS 
YSSISDTGNW VIKGTASGNS 
YSSISDGGNW IIKGTASGNS 
YSSISAGGNW IIKGTASGNS 
Y5SIS — GNW -IKGTASGNS 



600 

RGHFGASASV 
RGHFGASASV 
RGHFGT SAS V 
RGHFGASASV 
RGHFGASASV 
RGHFGASASV 
RGHFGASASV 
RGHFGTSASV 
RGHFGASASV 
RGHFGASASV 
RGHFG- SAS V 
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601 

EG327 GYQW. 

B2198 GYQW, 

BZ10 GYQW. 

HIS GYQW, 

EG329 GYQW. 

PMC21 GYQW. 

H3 8 GYQW. 

P20 GYQW. 

Z2491 GYQW. 

H4 1 GYQW . 

Consensus GYQW. 



C5 
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HI 5 ATGAACAAAA 
BZ10 ATGAACAAAA 
BZ198 ATGAACAAAA 
P2 0 ATGAACAAAA 
H38 ATGAACAAAA 
Z2491 ATGAACAAAA 
H4 1 ATGAACAAAA 
AT GAAC GAAA 
ATGAACAAAA 
ATGAACAAAA 
ATGAAC-AAA 



EG329 
PMC21 
EG327 
Consensus 



TAT AC C GC AT 
TATCCCGCAT 
TATACCGCAT 
TATACCGCAT 
TATACCGCAT 
TATACCGCAT 
TATACCGCAT 
TAT T GC GC AT 
TATACCGCAT 
TATACCGCAT 
TAT — CGCAT 



CAT T T GGAAT 
CAT TT GGAAT 
C AT TT GGAAT 
C AT TT GGAAT 
CATTT GGAAT 
CATTT GGAAT 
CATTT GGAAT 
CATTT GGAAT 
CATTT GGAAT 
C AT TT GGAAT 
CAT T T GGAAT 

CI 



AGTGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AGCGCCCTCA 
AGTGCCCTCA 
AGTGCCCTCA 
AG-GCCCTCA 



ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCCTGGGT 
ATGCATGGGT 
ATGCCTGGGT 
ATGC-TGGGT 



CGTCGTATCC 
CGTCGTATCC 
CGTCGTATCC 
AGTCGTATCC 
CGCCGTATCC 
CGCCGTATCC 
CGCCGTATCC 
CGTTGTATCC 
CGTCGTATCC 
CGCCGTATCC 
-G — GTATCC 



GAGCT CACAC 
GAGCTCACAC 
GAGCT CACAC 
GAGC T CACAC 
GAGCTCACAC 
GAGCTCACAC 
GAGCTCACAC 
GAGCTCACAC 
GAGCTCACAC 
GAGCTCACAC 
GAGCTCACAC 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG32 9 
PMC21 
EG327 
Consensus 



71 

GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 
GCAACCACAC 



CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 
CAAACGCGCC 



TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 
TCCGCAACCG 



TGGCGACCGC 
TGGCGACCGC 
TGGCGACCGC 
TGGCGACCGC 
TGAAGACCGC 
TGAAGACCGC 
TGAAGACCGC 
TGAAGACCGC 
TGAAGACCGC 
TGGCGACCGC 
TG — GACCGC 



CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 
CGTATTGGCG 



ACACTGTTGT 
ACACTGTTGT 
ACACTGTTGT 
ACACTGCTGT 
ACGCTGTTGT 
ACACTGTTGT 
ACACTGTTGT 
ACTCTGTTGT 
ACTCTGTTGT 
ACACTGTTGT 
AC-CTG-TGT 



140 

TTGCAACGGT 
TTGCAACGGT 
TTGCAACGGT 
CCGCAACGGT 
TTGCAACGGT 
TTGCAACGGT 
T TGCAAC GGT 
TTGCAACGGT 
TTGCAACGGT 
TTGCAACGGT 
— GCAACGGT 



CI 





141 












210 


H15 


TCAGGCGAAT 


GCTACCGATG 


ACGAC 




TATTTAGAAC 


CCGTACAACG 


CACTGCTGTC 


BZ10 


TCAGGCGAAT 


GC T ACC GAT G 


ACGAC 




TATTTAGAAC 


CCGTACAACG 


CACTGCTGTC 


BZ198 


T CAGGCGAAT 


GCTACCGATG 


ACGAC 


GATTTA 


TATTTAGAAC 


CCGTACAACG 


CACTGCTGTC 


P20 


TCAGGCGAAT 


GCTACCGATA 




. GAAGATGAA 


GAGTTAGAAT 


CCGTAGCACG 


CTCTGCTCTG 


H38 


TCAGGCGAAT 


GCTACCGATG 


AAGAT 


. GAAGAAGAA 


GAGTTAGAAC 


CCGTAGTACG 


CTCTGCTCTG 


Z2491 


TCAGGCGAAT 


GCTACCGATG 


AAGAT 


. GAAGAAGAA 


GAGTTAGAAT 


CCGTACAACG 


CTCTGTCGTA 


H41 


TCAGGCGAAT 


GCTACCGATG 


AAGAT 


. GAAGAAGAA 


GAGTTAGAAT 


CCGTACAACG 


CTCTG. . .TC 


EG32 9 


TCAGGCAAGT 


GCTAACAATG 


AAGAGCAAGA AGAAGATTTA 


TATTTAGACC 


CCGTGCTACG 


CACTGTTGCC 


PMC 2 1 


TCAGGCAAGT 


GC TAAC AATG 


AAGAGCAAGA AGAAGATTTA 


TATTTAGACC 


CCGTACAACG 


CACTGTTGCC 


EG327 


TCAGGCGAGT 


AC T ACC GAT G 


ACGAC 


GATTTA 


TATTTAGAAC 


CCGTACAACG 


CACTGCTGTC 


Consensus 


TCAGGC-A-T 


-CTA-C-AT- 


--GA 


GA A 


-A-TTAGA — 


CCGT ACG 


C-CTG 




CI 






VI 
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HI 5 
BZ10 
BZ198 

P2 0 

H38 
22491 

H41 
EG329 
PMC 21 
EG327 
Consensus 



211 

GTGTTGAGCT 
GTGTTGAGCT 
GTGTTGAGCT 
GTGTTGCAAT 
GTGTTGCAAT 
GGG. .AGCAT 
GTAGGGAGCA 
GTGTTGATAG 
GT GT T GAT AG 
GTGTTGAGCT 



TCCGTTCCGA 
TCCGTTCCGA 
TCCGTTCCGA 
TCATGATCGA 
T CAT GAT C GA 
TCAAG.CCAG 
TTCAAGCCAG 
TCAATTCCGA 
TCAATTCCGA 
TCCGTTCCGA 
T C— 



TAAAGAAGGC 
TAAAGAAGGC 
TAAAGAAGGC 
TAAAGAAGGC 
TAAAGAAGGC 
TAT GGAAGGC 
TAT GGAAGGC 
TAAAGAAGGC 
TAAAGAAGGC 
TAAAGAAGGC 
TA — GAAGGC 



ACGGGAGAAA 
ACGGGAGAAA 
ACGGGAGAAA 
AATGGAGAAA 
AATGGAGAAA 
AGCGGCGAAT 
AGCGTCGAAT 
ACGGGAGAAA 
ACGGGAGAAA 
ACGGGAGAAA 
A--G — GAA- 
VI 



AAGAAGGTAC 
AAGAAGGTAC 
AAGAAGGTAC 
TCGAATCTAC 
ACGAATCTAC 
T GGAAAC GAT 
TGGAAACGAT 
AAGAAAAAGT 
AAGAAAAAGT 
AAGAAGT T AC 
— GAA 



AGAAGA. 
AGAAGA. 
AGAAGA. 
AGGAGA. 
AGGAAA. 
AT CAT . . 

A 

AGAAGAAAAT 
AGAAGAAAAT 
AGAAGA. . . T 



. T 
. T 
. T 
. T 
.T 
. T 



280 

TCAAATTGGG 
TCAAATTGGG 
TCAAATTGGG 
ATAGGTT GGA 
ATAGGTTGGA 
ATCAATGACT 
T CAT TAT C AA 
TCAGATTGGG 
TCAGATTGGG 
TCAAATTGGG 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



281 

CAGTATATTT 
CAGTATATTT 
CAGTATATTT 
GTATATATTA 
GTATATATTA 
AACGACAGCA 
TGACTAACGA 
CAGTATATTT 
CAGTATATTT 
GAGTATATTT 
A 



CGACGAGAAA 
CGACGAGAAA 
CGACGAGAAA 
CGACGATCAC 
CGACAATCAC 
AGGAATTTGT 
CAGCAAGGAA 
CAACGAGAAA 
CAACGAGAAA 
CGACAAGAAA 



AGAGT AC TAA 
AGAGTACTAA 
AGAGT AC TAA 
AACACTCTAC 
AACACTCTAC 
AGACC CATAC 
TTTGTAGACC 
GGAGTACTAA 
GGAGTACTAA 
GGAGTACTAA 



AAGCCGGAGC 
AAGCCGGAGC 
AAGCCGGAGC 
ACGGCGCAAC 
ACGGCGCAAC 

ATAGTA 

CATACATAGT 
CAGCCAGAGA 
CAGCCAGAGA 
CAGCCGGAAC 



AATCACCCTC 
AATCACCCTC 
AATCACCCTC 
CGTTACCCTC 
CGTTACCCTC 

. GTTACCCTC 
AGTTACCCTC 
AATCACCCTC 
AATCACCCTC 
AATCACCCTC 

--T-ACCCTC 



AAAGC C GGCG 
AAAGCCGGCG 
AAAGC C GGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 



350 

ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 



VI 



C2 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



351 

AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 
AATCAAACAA 



C2 



420 

AACACCAATG AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTC CCTGAAA 
AACACCAATG AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTCCCTGAAA 

AACACCAATG AAAACACC AATGACAGTA GCTTCACCTA CTCCCTGAAA 

AGCGGCAAAG A CTTCACCTA CTCGCTGAAA 

AACACCAATA AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTCGCTGAAA 

AACACCAATG AAAACACC AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

AACACCAATG AAAACACC AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

AAC G GCACAA ACTTCACCTA CTCGCTGAAA 

AAC G ... .GCACAA ACTTCACCTA CTCGCTGAAA 

AACACCAATG AAAACACC AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

A _ c C TTCACCTA CTC-CTGAAA 

~ Y2 C3 
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HIS 
BZ10 
BZ198 

P2 0 

H38 
22491 

H41 
EG32 9 
PMC21 
EG327 
Consensus 



421 

AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGAGC TGA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGA-CT-A 



CAGATC TGAC 
CAGATC TGAC 
CAGATC TGAC 
AAGACC TGAC 
CAGATC TGAC 
CAGGCCTGAT 
CAGGCCTGAT 
CAGATC TGAC 
CAGATC TGAC 
CAGATCTGAC 
-AG — CTGA- 



CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAATGTTGAA 
CAATGTTGAA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CA-TGTTG-A 



AC TGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 



TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTAG 
TATCGTTTAG 
TATCGTTTAG 
TATCGTTT-G 



C3 



CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACAGC 
CGCAAAC-G- 



450 

AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
A-AGAAAGTCA 
AAGAAAGTCA 
AATAAAGTCA 
AATAAAGTCA. 
AATAAAGTCA 
AA-AAAGTCA 



HIS 
BZ10 
BZ198 

P2 0 

H38 
Z2491 

H41 
EG32 9 
PMC21 
EG327 
Consensus 



491 

ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCATAAG 
ACATCATAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCA-AAG 



CGAC AC CAAA 
CGACACCAAA 
CGAC AC CAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 



GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 



TTGC GAAAGA 
TTGCGAAAGA 
TTGC GAAAGA 
TTGCGAAAGA 
TCGC GAAAGA 
TCGC GAAAGA 
TCGC GAAAGA 
TTGCGAAAGA 
TTGCGAAAGA 
TCGCGAAAAA 
T-GCGAAA-A 



AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGAG 
AACGGCTG-G 



C3 



AC GAACGGCG 
ACGAACGGCG 
AC GAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
AC-AACGGCG 



560 

A.CCCCACGGT 
ACCCCACGGT 
A.CCCCACGGT 
A.CCCCACGGT 
A.CACCAC GGT 
ACACCACGGT 
AXACCACGGT 
ACACCACGGT 
-ACACCACGGT 
-ACACCACGGT 
AC -CC AC GGT 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



561 

TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 



GGTATC GGTT 
GGTATC GGTT 
GGTATCGGTT 
GGT AT C GGTT 
GGTATT GGT T 
GGTATCGGTT 
GGTATCGGTT 
GGTAT T GGTT 
GGTATTGGTT 
GGTATCGGTT 
GGTAT -GGTT 



CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 



CGATACGCTG 
CGATACGCTG 
CGATACGCTG 
CGATACGCTT 
CGATACGCTG 
CGATACGCTT 
CGATATGCTG 
CGATACGCTG 
CGATACGCTG 
CGATACGCTG 
C GAT A - GC T — 



C3 



CTGAATACCG 
CTGAATACCG 
CTGAATACCG 
GCGGGTTCTT 
CTGAATACCG 
GCGGGTTCTT 
CTGAATACCG 
CTGAATACCG 
CTGAATACCG 
CTGAATACCG 
— G — T-C — 



GAGC GACCAC 
GAGCGACCAC 
GAGC GACCAC 
CTGCTTCTCA 
GAGCGACCAC 
CTGCTTCTCA 
GAGCGACCAC 
GAGCGACCAC 
GAGCGACCAC 
GAGCGACCAC 



630 

AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
CGTTGATGCG 
AAACGTAACC 
CGTTGATGCG 
AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
G C- 



V3 
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HI 5 
B210 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG32 7 
Consensus 



631 

AACGACAACG 
AACGACAACG 
AACGACAACG 
GGTAACCAAA 
AACGACAACG 
GGTAACCAAA 
AACGACAACG 
AACGACAACG 
AACGACAACG 
AACGACAACG 
AC-A— 



TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
GTACACATTA 
TTACCGATGA 
GTACACATTA 
TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
— TAC — AT -A 



CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 

C ACT 

CAAGAAAAAA 

C ACT 

CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 
C A— 



CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
C GTGCAGC AA 
CGTGCGGCAA 
CGTGCAGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGC-GCAA 



GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
GTATTAAGGA 
GCGTTAAAGA 
GTATTAAGGA 
GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
G — TTAA-GA 



CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
TGTGTTGAAT 
CGTATTAAAC 
TGTGTTGAAT 
CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
-GT-TT-AA- 



700 

GCAGGCTGGA 
GCAGGCTGGA 
GCAGGCTGGA 
GCGGGTTGGA 
GCAGGCTGGA 
GCGGGTTGGA 
GCAGGCTGGA 
GCTGGCTGGA 
GCTGGCTGGA 
GCAGGCTGGA 
GC-GG-TGGA 



V3 



C4 



701 

HI 5 ACATTAAAGG CGTTAAACCC 

BZ10 ACATTAAAGG CGTTAAACCC 

BZ198 ACATTAAAGG CGTTAAACCC 

P2 0 ATATTAAGGG TGTTAAAACT 

H38 ACATTAAAGG CGTTAAACCC 

Z2491 ATATTAAGGG TGTTAAAACT 

H41 ACATTAAAGG CGTTAAACCC 

EG329 ACATTAAAGG CGTTAAACCC 

PMC21 ACATTAAAGG CGTTAAACCC 

EG327 ACATTAAAGG CGTTAAACCC 

Consensus A-ATTAA-GG -GTTAAA-C- 
C4 



770 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCAC TTACGA 

GGTACAACAG CT TC CGATAACGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGCTCAACAA CTGGTCAATC AGAAAATGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC ACACTTACGA 

GGCTCAACAA CTGGTCAATC AGAAAATGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GG — CAACA- CT TC -GA~AA-GT- GATTTC GTCC -C AC TTACGA 



V4 



C5 



H15 
BZ10 
BZ198 

P20 

H38 
22491 

H41 
EG329 
PMC 21 
EG327 
Cons ensus 



771 

CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 
CACAGTCGAG 



TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 
TTCTTGAGCG 



CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 
CAGATACGAA 



AACAACGACT 
AACAAC GACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 
AACAACGACT 



GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 
GTTAATGTGG 



AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 
AAAGCAAAGA 



840 

CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 
CAACGGCAAG 



C5 



HIS 
BZ10 
B2198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



841 

AAAACCGAAG 
AGAAC CGAAG 
AAAACCGAAG 
AGAACCGAAG 
AGAACCGAAG 
AGAACCGAAG 
AAAACCGAAG 
AAAACCGAAG 
AAAACCGAAG 
AGAACCGAAG 
A-AACCGAAG 



TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 



TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 



TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATCA 
T CTGTTAT - A 



AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 



CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 



910 

GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 



C5 
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911 






HIS 


AAGGCAAAGA 


CGAGAATGGT 


TCTTC TACAG 


BZ10 


AAGGCAAAGG 


CGAGAATGGT 


TCTTCTACAG 


BZ198 


AAGGCAAAGA 


CGAGAATGGT 


TCTTCTACAG 


P20 


AAGGCAAAGG 


CGAGAATGGT 


TCTTCTACAG 


H38 


AAGGCAAAGG 


CGAGAATGGT 


TCTTCTACAG 


Z2491 


AAGGCAAAGG 


CGAGAATGGT 


TCTTCTACAG 


H41 


AAGGCAAAGG 


CGAGAATGGT 


TCTTCTACAG 


EG329 


AAGACAAAGG 


CGAGAATGGT 


TCTTCTACAG 


PMC 2 1 


AAGACAAAGG 


CGAGAATGGT 


TCTTCTACAG 


EG327 


AAGACAAAGG 


CGAGAATGAT 


TCTTCTACAG 


Consensus 


AAG-CAAAG- 


CGAGAATG-T 


TCTTCTACAG 



980 

ACGAAGGCGA AGGCTTAGTG AC TGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG AC TGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG AC TGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
ACGAAGGCGA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
AC AAAGGC GA AGGCTTAGTG ACTGCAAAAG AAGTGATTGA 
AC-AAGGCGA AGGCT TAGTG ACTGCAAAA G AAGTGATTGA 
C5 



HI 5 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



981 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 



AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 



GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 
GGAGAATGAA 



AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 
AACAACAACC 



GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 
GCTAATGGTC 



AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 
AAACAGGTCA 



105 0 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 



C5 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1051 

TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 
TTTGAAACCG 



TTACATCAGG 
TTACATCAGG 
TT AC AT CAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 
TTACATCAGG 



CACAAAAGTA 
CACAAAAGTA 
CACAAATGTA 
CACAAAAGTA 
CACAAATGTA 
CACAAATGTA 
CACAAAAGTA 
CACAAATGTA 
CACAAATGTA 
CACAAATGTA 
C AC AAA- GT A 



ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 
ACCTTTGCTA 



GTGGTAATGG 
GTGGTAATGG 
GTGGTAAAGG 
GTGGTAATGG 
GTGGTAAAGG 
GTGGTAAAGG 
GTGGTAATGG 
GTGGTAAAGG 
GTGGTAAAGG 
GTGGTAAAGG 
GTGGTAA-GG 



TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 
TACAACTGCG 



1120 
AC TGTAAGTA 
AC TGTAAGTA 
ACTGTAAGTA 
ACT GTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 
ACTGTAAGTA 



C5 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1121 

AAGAT GATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 



AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 
AGGCAACATC 



ACTGTTAAGT 
ACTGT TAAGT 
ACTGTTAAGT 
ACTGTTAAGT 
ACTGTTAAGT 
ACTGTTATGT 
ACTGTTAAGT 
ACTGTTATGT 
ACTGTTATGT 
ACTGTTATGT 
ACTGTTA-GT 



ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 
ATGATGTAAA 



TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 
TGTCGGCGAT 



GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 
GCCCTAAACG 



1190 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 
TCAATCAGCT 



C5 
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1191 




HIS 


GCAAAACAGC 


GGTT GGAATT 


B210 


GCAAAACAGC 


GGTT GGAATT 


BZ198 


GCAAAACAGC 


GGTT GGAATT 


P20 


GCAAAACAGC 


GGTT GGAATT 


H38 


GCAAAACAGC 


GGTT GGAATT 


Z2491 


GCAAAACAGC 


GGTT GGAATT 


H41 


GCAAAACAGC 


GGTT GGAATT 


EG329 


GCAAAACAGC 


GGTTGGAATT 


PMC 21 


GCAAAACAGC 


GGTT GGAATT 


EG327 


GCAAAACAGC 


GGTT GGAATT 


Consensus 


GCAAAACAGC 


GGTT GGAATT 



TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 



AGC GGTTGCA 
AGCGGTTGCA 
AGC GGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 



GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 



GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 



1260 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 



C5 



HIS 
BZ10 
BZ198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1261 

GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 
GTTTCGCCGA 



GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 
GCAAGGGAAA 



GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATGAA 
GATGGATjdAA 
GATGGATGAA 



ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 
ACCGTCAACA 



TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 
TTAATGCCGG 



CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 
CAACAACATC 



1330 
GAGATTACCC 
GAGATTACCC 
GAGATTACCC 
GAGATTACCC 
GAGATTACCC 
GAGATTAGCC 
GAGATTACCC 
GAGATTACCC 
GAGATTACCC 
GAGATTACCC 
GAGATTA- CC 
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GCAACGGCAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGTAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGTAA 
GCAACGGCAA 
GC AACGG-AA 



AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 



ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 



CGATGACCCC 
CGATGACCCC 
CGATGGCGCC 
CGATGACCCC 
CGATGACCCC 
CGATGGCGCC 
CGATGACCCC 
CGATGACCCC 
CGATGACCCC 
CGATGACCCC 
CGATG-C-CC 



GCAATTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAGTTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAGTTTTCC 
GCAATTTTCC 
GCA-TTTTCC 



AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
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TCGGCGCGGG 
TCGGCGCGGG 
TCGGTGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGG-GCGGG 
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1401 

GGCGGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGCAGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGCGGATGCG 
GGC-GATGCG 



CCCACTTTAA 
CCCACTTTAA 
CCCACTTTGA 
CCCACTTTAA 
CCCACTTTGA 
CCCACTTTAA 
CCCACTTTAA 
CCCACTTTGA 
CCCACTTTGA 
CCCACTTTAA 
CCCACTTT-A 



GCGTGGATGA 
GCGTGGATGA 
GCGT GGATGA 
GCGTGGATGA 
GCGTGGATGA 
GCGTGGATGA 
GCGTGGATGA 
GCGTGGAT . . 
GCGTGGAT. . 
GCGTGGATGA 
GCGTGGAT — 



CGAGGGCGCG 
CGAGGGCGCG 
CGAGGGCGCG 
CGAGGGCGCG 
CAAGGGCGCG 
CGAGGGCGCG 
CGAGGGCGCG 
.GGGGACGCA 
.GGGGACGCA 
CGAGGGCGCG 
GG-CGC- 



TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 
TTGAATGTCG 



GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGGA 
GCAGCAAGAA 
GCAGCAAGAA 
GCAGCAAGGA 
GCAGCAAG-A 



1470 
TGCCAACAAA 
TGCCAACAAA 
TACCAACAAA 
TGCCAACAAA 
TGCCAACAAA 
TGCCAACAAA 
TGCCAACAAA 
GGACAACAAA 
GGACAACAAA 
TGCCAACAAA 
CAACAAA 
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1471 

CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 



TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 



CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 



GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 



GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 



AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCG 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGC- 



1540 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 



1541 

GTGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
G-GTGGCGCA 



AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 



AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AAC CACATC G 
AACC-CATCG 
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ACAATGTGGA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGAA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGAA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTG-A 



CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 



CGCGCGGGTA 
CGCGCGGGTA 
CGTGCGGGCA 
CGCGCGGGTA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CG-GCGGG-A 
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TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
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1611 

GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 



GCAGGTTT GG 
GCAGGTTTGG 
GCAGGTCTAG 
GCAGGTTTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGT-T-G 



CTCAGGCGTA 
CTCAGGCCTA 
TTCAGGCGTA 
CTCAGGCCTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
-TCAGGC-TA 



TTTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
TCTGCCCGGC 
TCTGCCCGGC 
TTTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
T-TGCCCGGC 



AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 



TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 



1680 
CGGCGGTACT 
CGGCGGTACT 
CGGCGACACT 
CGGCGGTACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCG — ACT 
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1 MNKI YRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

51 ANNETDLTSV GTEKLSFSAN GNKVNITSDT KGLNFAKETA GTNGDTTVHL 

101 NGIGSTLTDT LLNTGATTNV TNDNVTDDEK KRAASVKDVL NAGWNIKGVK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS 

2 01 VIKEKDGKLV TGKDKGENGS STDEGEGLVT AKEVI DAVNK AGWRMKTTTA 

251 NGQTGQADKF ETVTSGTNVT FASGKGTTAT VSKDDQGNIT VMYDVNVGDA 

301 LNVNQLQNSG WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE 

351 1TRNGKNIDI ATSMTPQFSS VSLGAGADAP TLSVDGDALN VGSKKDNKPV 

4 01 RITNVAPGVK EGDVTNVAQL KGVAQNLNNR IDNVDGNARA GIAQAIATAG 

451 LVQAYLPGKS MMAIGGGTYR GEAGYAIGYS SISDGGNWII KGTASGNSRG 

501 HFGASASVGY QW* 



FIG. 5A 

1 AT GAAC AAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCATGGGT 

51 CGTCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACCGC CGTATTGGCG ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 

151 GCTAACAATG AAACAGATCT GACCAGTGTT GGAACT GAAA AATTATCGTT 

2 01 TAGC GCAAAC GGCAATAAAG TCAACATCAC AAGCGACACC AAAGGCTTGA 

251 ATTTTGCGAA AGAAACGGCT GGGACGAACG GCGACACCAC GGTTCATCTG 

301 AACGGTATTG GTTCGACTTT GACCGATACG CTGCTGAATA CCGGAGCGAC 

351 CACAAACGTA ACCAACGACA ACGTTACCGA TGACGAGAAA AAACGTGCGG 

401 CAAGCGT TAA AGAC GTATTA AACGCTGGCT GGAACATTAA AGGCGTTAAA 

' 451 CCCGGTACAA CAGCTTCCGA TAACGT T GAT TTCGTCCGCA CTTACGACAC 

501 AGTCGAGTTC TTGAGCGCAG ATACGAAAAC AACGACTGTT AATGTGGAAA 

551 GCAAAGACAA CGGCAAGAAA AC C GAAGTTA AAATCGGTGC GAAGACTTCT 

601 GTTATTAAAG AAAAAGACGG TAAGTT GGTT ACTGGTAAAG ACAAAG GCGA 

651 GAATGGTTCT TCTACAGACG AAGGCGAAGG CTTAGTGACT GCAAAAGAAG 

701 TGATTGATGC AGTAAACAAG GCTGGTTGGA GAAT GAAAAC AACAACCGCT 

751 AAT GGTCAAA CAGGT CAAGC TGACAAGTTT GAAACCGTTA CATCAGGCAC 

801 AAATGTAACC TTTGCTAGTG GTAAAGGTAC AACTGCGACT GTAAGTAAAG 

851 AT GAT CAAGG CAACATCACT GTTATGTATG AT GTAAAT GT CGGCGATGCC 

901 CTAAACGT CA ATCAGCTGCA AAACAGCGGT TGGAATTTGG ATTC CAAAGC 

951 GGTTGCAGGT TCTTCGGGCA AAGT CAT CAG CGGCAATGTT TCGCCGAGCA 

1001 AGGGAAAGAT GGATGAAACC GTCAACATTA ATGCCGGCAA CAACATCGAG 

1051 ATTACCCGCA ACGGTAAAAA TATCGACATC GCCACTTCGA TGACCCCGCA 

1101 GTTTTCCAGC GTTTCGCTCG GCGCGGGGGC GGATGCGCCC ACTTTGAGCG 

1151 TGGAT GGGGA CGCATT GAAT GTCGGCAGCA AGAAGGACAA CAAACCCGTC 

12 01 CGCATTACCA ATGTCGCCCC GGGCGTTAAA GAGGGGGATG TTACAAACGT 

1251 CGCACAACTT AAAGGCGTGG CGCAAAACTT GAACAACCGC ATCGACAATG 

1301 TGGACGGCAA CGCGCGTGCG GGCATCGCCC AAGCGATTGC AACCGCAGGT 

1351 CTGGTTCAGG CGTATTTGCC CGGCAAGAGT AT GAT GG C GA TCGGCGGCGG 

14 01 CACTTATCGC GGCGAAGCCG GTTACGCCAT CGGCTACTCC AGTATTTCCG 

1451 ACGGCGGAAA TTGGATTATC AAAGGCACGG CTTCCGGCAA TTCGCGCGGC 

1501 CATTTCGGTG CTTCCGCATC TGTCGGTTAT CAGTGGTAA 
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1 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA T L L FAT VQ AN 

51 ATDETGLINV ETEKLSFGAN GKKVNIISDT KGLNFAKETA GTNGDTTVHL 

101 NGIGSTLTDM LLNTGATTNV TNDNVTDDEK KRAASVKDVL NAGWNIKGVK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS 

201 VIKEKDGKLV TGKGKGENGS STDEGEGLVT AKEVIDAWK AGWRMKTTTA 

251 NGQTGQADKF ETVTSGTKVT FASGNGTTAT VSKDDQGNIT VKYDVNVGDA 

301 LNVNQLQNSG WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE 

351 ITRNGKNIDI ATSMTPQFSS VSLGAGADAP TLSVDDEGAL NVGSKDANKP 

4 01 VRITNVAPGV KEGDVTNVAQ LKGVAQNLNN RIDNVNGNAR AGIAQAIATA 

451 GLVQAYLPGK SMMAXGGGTY LGEAGYAIGY SSISAGGNWI IKGTAS GNSR 

501 GHFGASASVG YQW* 



FIG. 6A 



1 AT GAACAAAA TATACCGCAT CAT TT GGAAT AGTGCCCTCA ATGCCTGGGT 

51 CGCCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACCGC CGTATTGGCG ACACTGTTGT TTGCAACGGT TCAGGCGAAT 

151 GCTACCGATG AAACAGGCCT GATCAATGTT GAAACTGAAA AATTATCGTT 

201 TGGCGCAAAC GGCAAGAAAG TCAACATCAT AAGC GACACC AAAGGCTTGA 

251 ATTTCGCGAA AGAAACGGCT GGGACGAACG GCGACACCAC GGTTCATCTG 

301 AACGGTATCG GTTCGACTTT GACCGATATG CTGCTGAATA CCGGAGCGAC 

3 51 CACAAACGTA ACCAACGACA ACGTTACCGA TGACGAGAAA AAACGTGCGG 

4 01 CAAGCGTTAA AGACGTATTA AACGCAGGCT GGAACATTAA AGGCGTTAAA 
451 CCCGGTACAA CAGCTTCCGA TAACGTTGAT TTCGTCCGCA CTTACGACAC 

5 01 AGTCGAGTTC TTGAGCGCAG ATACGAAAAC AACGACTGTT AATGTGGAAA 
551 GCAAAGACAA CGGCAAGAAA ACC GAAGTTA AAATCGGTGC GAAGACTTCT 
601 GTTATTAAAG AAAAAGACGG TAAGTTGGTT ACT GGTAAAG GCAAAGGCGA 
651 GAATGGTTCT TCTACAGACG AAGGCGAAGG CTTAGT GACT GCAAAAGAAG 

7 01 T GAT T GAT GC AGTAAACAAG GCTGGTTGGA GAAT GAAAAC AACAACCGCT 
751 AATGGT CAAA CAGGTCAAGC TGACAAGTTT GAAACCGTTA CATCAGGCAC 
801 AAAAGTAACC TTTGCTAGTG GTAAT GGT AC AACT GCGACT GTAAGTAAAG 

8 51 AT GAT CAAG G CAACATCACT GTTAAGTATG ATGTAAATGT CGGCGATGCC 
901 CTAAACGTCA ATCAGCTGCA AAACAGCGGT TGGAATTTGG ATT C CAAAGC 
951 GGTTGCAGGT TCTTCGGGCA AAGT CAT CAG CGGCAATGTT TCGCCGAGCA 

1001 AGGGAAAGAT GGATGAAACC GTCAACATTA AT GC C GGCAA CAACATCGAG 

1051 ATTACCCGCA ACGGCAAAAA TATCGACATC GCCACTTCGA TGACCCCGCA 

1101 ATTTTCCAGC GTTTCGCTCG GCGCGGGGGC GGATGCGCCC ACTTTAAGCG 

1151 TGGATGACGA GGGCGCGTTG AATGTCGGCA GCAAGGATGC CAACAAACCC 

1201 GTCCGCATTA CCAATGTCGC CCCGGGCGTT AAAGAGGGGG AT GTTACAAA 

1251 CGTCGCGCAA CTTAAAGGTG TGGCGCAAAA CTTGAACAAC CGCATCGACA 

13 01 ATGTGAACGG CAACGCGCGT GC GGGCATCG CCCAAGCGAT TGCAACCGCA 
1351 GGTCTGGTTC AGGCGTATCT GCCCGGCAAG AGTATGATGG CGATCGGCGG 

14 01 CGGCACTTAT CTCGGCGAAG CCGGTTATGC CATCGGCT&C TCAAGCATTT 
1451 CCGCCGGCGG AAATT GGATT AT CAAAGGCA CGGCTTCCGG CAATTCGCGC 
1501 GGCCATTTCG GTGCTTCCGC ATCTGTCGGT TATCAGTGGT AA 
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1 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

51 ANNVDFVRTY DTVEFLJSADT KTTTVNVESK DNGKKTEVKI GAKTSVIKEK 

101 DGKLVTGKDK GENGSSTDEG EGLVTAKEVI DAVNKAGWRM KTTTANGQTG 

151 QADKFETVTS GTNVTFASGK GTTATVS KDD QGNITVMYDV NVGDALNVNQ 

201 LQNSGWNLDS KAVAGSSGKV ISGNVSPSKG KMDETVNINA GNNIEITRNG 

251 KNIDIATSMT PQFSSVSLGA GADAPTLSVD GDALNVGSKK DNKPVRITNV 

301 APGVKEGDVT NVAQLKGVAQ NLNNRIDNVD GNARAGIAQA IATAGLVQAY 

351 LPGKSMMAIG GGTYRGEAGY AIGYSSISDG GNWIIKGTAS GNSRGHFGAS 

4 01 ASVGYQW* 



FIG. 7 A 



1 


ATGAACAAAA 


TATACCGCAT 


51 


CGTCGTATCC 


GAGCT CACAC 


101 


T GAAGACCGC 


CGTATTGGCG 


151 


GCTAACAACG 


TTGATTTCGT 


201 


CGCAGATACG 


AAAACAACGA 


251 


AGAAAACCGA 


AGTTAAAATC 


301 


GACGGTAAGT 


TGGTTACTGG 


351 


AGACGAAGGC 


GAAGGCTTAG 


401 


ACAAGGCTGG 


TTGGAGAATG 


451 


CAAGCTGACA 


AGTTT GAAAC 


501 


TAGTGGTAAA 


GGTACAACTG 


551 


TCACTGTTAT 


GTATGATGTA 


601 


CTGCAAAACA 


GCGGTTGGAA 


651 


GGGCAAAGTC 


ATCAGCGGCA 


701 


AAACCGT CAA 


CATTAATGCC 


751 


AAAAATATCG 


ACATCGCCAC 


801 


GCTCGGCGCG 


GGGGCGGATG 


851 


TGAATGTCGG 


CAGCAAGAAG 


901 


GCCCCGGGCG 


TTAAAGAGGG 


951 


CGTGGCGCAA 


AACTT GAACA 


1001 


GTGCGGGCAT 


CGCCCAAGCG 


1051 


TTGCCCGGCA 


AGAGT AT GAT 


1101 


AGCCGGTTAC 


GCCATCGGCT 


1151 


TTAT CAAAGG 


CACGGCTTCC 


1201 


GCATCTGTCG 


GTTATCAGTG 



CATTTGGAAT AGTGCCCTCA ATGCATGGGT 
GCAACCACAC CAAACGCGCC TCCGCAACCG 
ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 
CCGCACTTAC GACACAGTCG AGTTCTTGAG 
CTGTTAATGT G GAAAGC AAA GACAACGGCA 
GGTGCGAAGA CTTCTGTTAT TAAAGAAAAA 
TAAAGACAAA GGCGAGAATG GTTCTTCTAC 
TGACTGCAAA AGAAGTGATT GAT GCAGTAA 
AAAACAACAA CCGCTAATGG TCAAACAGGT 
CGTTACATCA GGCACAAATG TAACCTTTGC 
CGACT GTAAG T AAAGAT GAT CAAGGCAACA 
AATGTCGGCG ATGCCCTAAA CGTCAATCAG 
TTTGGATTCC AAAGCGGTTG CAGGTTCTTC 
ATGTTTCGCC GAGCAAGGGA AAGATGGATG 
GGCAACAACA T C GAGATT AC CCGCAACGGT 
TTCGATGACC CCGCAGTTTT CCAGCGTTTC 
CGCCCACTTT GAGCGTGGAT GGGGACGCAT 
GACAACAAAC CCGTCCGCAT TACCAATGTC 
G GAT GTTACA AACGTCGCAC AACTTAAAGG 
ACCGCATCGA CAATGTGGAC GGCAACGCGC 
ATTGCAACCG CAGGTCTGGT TCAGGCGTAT 
GGCGATCGGC GGCGGCACTT ATCGCGGCGA 
ACTCCAGTAT TTCCGACGGC GGAAATTGGA 
GGCAATTCGC GCGGCCATTT CGGTGCTTCC 
GTAA 
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1 MNKIYRIIWN 

51 ANRAASVKDV 

101 VNVESKDNGK 

151 TAKEVIDAVN 

201 TVSKDDQGNI 

2 51 VSPSKGKMDE 

3 01 PTLSVDGDAL 
351 RIDNVDGNAR 
401 SSISDGGNWI 



SALNAWWVS 
LNAGWNIKGV 
KTEVKIGAKT 
KAGWRMKTTT 
TVMYDVNVGD 
TVNINAGNNI 
NVGSKKDNKP 
AG I AQAI ATA 
IKGTASGNSR 



ELTRNHTKRA 
KPGTTASDNV 
SVIKEKDGKL 
ANGQTGQADK 
ALNVNQLQNS 
EITRNGKNID 
VRITNVAPGV 
GLVQAYLPGK 
GHFGASASVG 



SATVKTAVLA 
DFVRTYDTVE 
VTGKDKGENG 
FETVTSGTNV 
GWNLDSKAVA 
IATSMTPQFS 
KEGDVTNVAQ 
SMMAIGGGTY 
YQW* 



TLLFATVQAS 
FLSADTKTTT 
SSTDEGEGLV 
TFASGKGTTA 
GSSGKVISGN 
SVS LGAGADA 
LKGVAQNLNN 
RGEAGYAIGY 



FIG. 8A 



1 AT GAACAAAA TATACCGCAT 

51 CGTCGTATCC GAGCTCACAC 

101 TGAAGACCGC CGTATTGGCG 

151 GCTAACCGTG CGGCAAGCGT 

201 TAAAGGCGTT AAACCCGGTA 

251 GCACTTACGA CACAGTCGAG 

301 GTTAATGTGG AAAGCAAAGA 

351 TGCGAAGACT TCTGTTATTA 

401 AAGACAAAGG CGAGAATGGT 

451 ACT GCAAAAG AAGTGATTGA 

501 AACAACAACC GCTAATGGTC 

551 TTACATCAGG CACAAATGTA 

601 ACT GT AAGT A AAGATGATCA 

651 TGTCGGCGAT GCCCTAAACG 

701 TGGATTCCAA AGCGGTT GCA 

751 GTTTCGCCGA GCAAGGGAAA 

801 CAACAACATC GAGATTACCC 

851 CGATGACCCC GCAGTTTTCC 

901 CCCACTTTGA GCGTGGATGG 

951 CAACAAACCC GTCCGCATTA 

1001 AT GTTACAAA CGTCGCACAA 

1051 CGCAT CGACA ATGTGGACGG 

1101 TGCAACCGCA GGTCTGGTTC 

1151 CGATCGGCGG CGGCACTTAT 

1201 TCCAGTATTT CCGACGGCGG 

1251 CAATT CGCGC GGCCATTTCG 

1301 AA 



CATTTGGAAT AGTGCCCTCA ATGCATGGGT 
GCAACCACAC CAAACGCGCC TCCGCAACCG 
ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 
TAAAGACGTA TTAAACGCTG GCTGGAACAT 
CAACAGCTT C CGATAACGTT GATTTCGTCC 
TTCTTGAGCG CAGATACGAA AACAACGACT 
CAACGGCAAG AAAACCGAAG TTAAAA.T CGG 
AAGAAAAAGA CGGTAAGTTG GTTACT G GTA 
TCTTCTACAG ACGAAGGCGA AGGCTTAGTG 
TGCAGTAAAC AAGGCTGGTT GGAGAATGAA 
AAACAGGTCA AGCTGACAAG TTTGAAACCG 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG 
AGGCAACATC ACT GT TATGT ATGATGTAAA 
TCAATCAGCT GCAAAACAGC GGTTGGAATT 
GGTTCTTCGG GCAAAGT CAT CAGCGGCAAT 
G AT GGATGAA ACCGTCAACA TTAATGCCGG 
GCAACGGTAA AAATAT CGAC ATCGCCACTT 
AGCGTTTCGC TCGGCGCGGG GGCGGATGCG 
GGACGCATTG AATGTCGGCA GCAAGAAGGA 
CCAATGTCGC CCCGGGCGTT AAAGAGGGGG 
CTTAAAGGCG TGGCGCAAAA CTTGAACAAC 
CAACGCGCGT GCGGGCATCG CCCAAGCGAT 
AGGCGTATTT GCCCGGCAAG AGTATGATGG 
CGCGGCGAAG CCGGTTACGC CAT CGGCTAC 
AAATTGGATT AT CAAAGGCA CGGCTTCCGG 
GTGCTTCCGC ATCTGTCGGT TATCAGTGGT 
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1 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

51 ANTLKAGDNL KIKQFTYSLK KDLTDLTSVG -TEKLSFSANG NKVNITSDTK 

101 GLN FAKETAG TNGDTTVHLN GIGSTLTDRA ASVKDVLNAG WNI KGVKNVD 

151 FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS VIKEKDGKLV 

2 01 TGKDKGENGS STDEGEGLVT AKEVIDAVNK AGWRMKTTTA NGQTGQADKF 

251 ETVTSGTNVT FASGKGTTAT VSKDDQGNIT VMYDVNVGDA LNVNQLQNSG 

301 WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE ITRNGKNIDI 

351" ATSMTPQFSS VSLGAGADAP TLSVDGDALN VGSKKDNKPV RITNVAPGVK 

401 EGDVTNVAQL KGVAQNLNNR I DNVDGNARA GIAQAIATAG LVQAYLPGKS 

451 MMAIGGGTYR GEAGYAIGYS SISDGGNWII KGTASGNSRG HFGASASVGY 

501 QW* 



FIG. 9 A 



1 ATGAACAAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCATGGGT 

51 CGTCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACCGC CGTATTGGCG ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 

151 GCTAACACCC TCAAAGCCGG CGACAACCTG AAAATCAAAC AATT CACCTA 

201 CTCGCTGAAA AAAGACCTCA CAGATCTGAC CAGTGTTGGA ACT GAAAAAT 

251 TAT CGTTTAG CGCAAACGGC AATAAAGTCA ACATCACAAG CGACAC CAAA 

301 GGCTTGAATT TTGCGAAAGA AACGGCTGGG ACGAACGGCG ACACCACGGT 

351 TCATCTGAAC GGTATTGGTT CGACTTT GAC CGATCGTGCG GCAAGCGTTA 

401 AAGACGTATT AAACGCTGGC T GGAACAT T A AAGGCGTTAA AAACGTTGAT 

451 TTCGTCCGCA CTTACGACAC AGTCGAGTTC TTGAGCGCAG ATACGAAAAC 

501 AACGACTGTT AATGT GGAAA GCAAAGACAA CGGCAAGAAA ACCGAAGTTA 

551 AAATCGGTGC GAAGACTTCT GTTATTAAAG AAAAAGACGG TAAGTTGGTT 

601 ACT GGTAAAG ACAAAGGCGA GAATGGTTCT TCTACAGACG AAGGCGAAGG 

651 CTTAGT GACT GCAAAAGAAG TGATTGATGC AGTAAACAAG GCTGGTTGGA 

701 GAATGAAAAC AACAACCGCT AAT GGT CAAA CAGGT CAAGC TGACAAGTTT 

751 GAAACCGTTA CATCAGGCAC AAATGTAACC TTTGCTAGTG GTAAAGGTAC 

8 01 AACTGCGACT GTAAGTAAAG AT GAT CAAGG CAACAT CACT GTTATGTATG 

851 ATGTAAATGT CGGCGATGCC CTAAACGTCA AT CAGCT GCA AAACAGCGGT 

901 TGGAATTTGG ATTCCAAAGC GGTTGCAGGT TCTTCGGGCA AAGT CAT CAG 

951 CGGCAAT GTT TCGCCGAGCA AGGGAAAGAT GGAT GAAACC GT CAACATT A 

1001 ATGCCGGCAA CAACAT CGAG ATTACCCGCA AC GGTAAAAA TATCGACATC 

1051 GCCACTTCGA TGACCCCGCA GTTTTCCAGC GTTTCGCTCG GCGCGGGGGC 

1101 GGATGCGCCC ACTTTGAGCG TGGATGGGGA CGCATTGAAT GTCGGCAGCA 

1151 AGAAGGACAA CAAACCCGTC CGCATTACCA ATGTCGCCCC -GGGCGTTAAA 

1201 GAGGGGGATG T TACAAAC GT CGCACAACTT AAAGGCGTGG CGCAAAACTT 

1251 GAACAACCGC ATCGACAATG TGGACGGCAA CGCGCGTGCG GGCATCGCCC 

1301 AAGCGATTGC AACCGCAGGT CTGGTTCAGG CGTATTTGCC CGGCAAGAGT 

1351 AT GAT GGCGA TCGGCGGCGG CACTTATCGC GGCGAAGCCG GTTACGCCAT 

14 01 CGGCTACTCC AGTATTTCCG ACGGCGGAAA TTGGATTATC AAAGGCACGG 

14 51 CTTCCGGCAA TTCGCGCGGC CATTTCGGTG CTTCCGCATC TGTCGGTTAT 

1501 CAGTGGTAA 
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1 50 

H41 MNKIYRIXWN SALNAWVAVS ELTRNHTKRA SATVKTAVL A TLLFATVOAN 

PMC 21 MNKI YRIIWN SALNAWVVVS DLT RNHTKRA SATVNTAVLA TLLFATVQAS 

H4IStudel MNKI YRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA TLLFATVQAN 

PMC21Bgldel MNKI YRIIWN SALNAVJVVVS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

PMC21C1C5 MNKIYRIIWK SALNAWVVVS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

CI 



51 100 
H41 ATDSD. .EEE ELESVQRS.V VGSIQASMEG SVELET ... I SLSM TNDSKE 
PMC 21 ANNEEQEEYL YLHPVQRTVA VLIVNSDKEG AGEKEKVEEN SDWAVYFNEK 

H41Studel ATDE 

PMC21Bgldel ANNE 

PMC21C1C5 AN 

VI 



101 150 
H41 FVDPYIWXL KAGDNLKIKQ N.TNENTNAS SFTYSL KKDL TGLINVETEK 

PMC21 GVLTAREITL KAGDNLKI KQ NGTN FTYS LKKDL T DLTSVGTEK 

H41Studel TGLINVETEK 

PMC21Bgldel TDLTSVGTEK 

PMC21C1C5 

VI C2 V2 C3 



151 200 
H41 LSFGANGKKV NIISDTKGLN FAKETAGTNG DTTVHLNGIG STLTDMLLNT 
PMC21 LSFSAHGNKV NITSDTKGLN F AKE TAG TNG DTTVHL NGIG STLTDTLLNT 
H4lStudel LSFGANGKKV NIISDTKGLN FAKETAGTNG DTTVHLNGIG STLTDMLLNT 
PMC21Bgldel LSFSANGNKV NITSDTKGLN FAKETAGTNG DTTVHLNGIG STLTDTLLNT 

PMC21C1C5 

C3 V3 



201 250 
H41 GATTNVTNDN VTDDEKK RAA SVKDVLNAGW NIKGVKPGTT ASDNVDFVRT 
PMC 21 GATTNVTNDK VTDDBKKRAA^S VKDVLNAGW NIKG VKPGTT A SDNVDFVRT 
H41Studel GATTNVTNDK VTDDEKKRAA SVKDVLNAGW NIKGVKPGTT ASDNVDFVRT 
PMC21Bgldel GATTNVTNDN VTDDEKKRAA SVKDVLNAGW NIKGVKPGTT ASDNVDFVRT 

PMC21C1C5 NVDFVRT 

V3 C4 V4 C5 



251 300 
H41 YDTVEFLSA D TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKG 
PMC 21 YDTVEFLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVI KE KDGKLVTGKD 
H4lStudel YDTVEFLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKG 
PMC21Bgldel YDTVEFLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKD 
PMC21C1C5 YDTVEFLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKD 

C5 



301 350 

H41 KGENGSSTDE GEGLVTAKEV I DAVNKAGWR MKT T TA N GOT GOADKFETVT 

PMC 21 KGENGSSTDE GEGLV TAKEV I DAVNKAGWR M KTTTANGOT GOADKFETVT 

H41Studel KGENGSSTCE GEGLVTAKEV I DAVNKAGWR MKTTTANGQT GQADKFETVT 

PMC21Bgldel KGENGSSTDI GEGLVTAKEV I DAVNKAGWR MKTTTANGQT GQADKFETVT 

PMC21C1C5 KGENGSSTCE GEGLVTAKEV I DAVNKAGWR MKTTTANGQT GQADKFETVT 

C5 



351 400 
H41 SGTKVTFA SG NGTTATVSKD D QGNITVKYD VNVGDAL NVN OLQNSGWNLD 
PMC21 SGTNVTFAS3 KGTTA TVSKD DQGNITVMYD VNVGDALNVN OLQ NSGWNLD 
H41Studel SGTKVTFASG NGTTATVSKD DQGNITVKYD VNVGDALNVN QLQNSGWNLD 
PMC21Bgldel SGTNVTFASC- KGTTATVSKD DQGNITVMYD VNVGDALNVN QLQNSGWNLD 
PMC21C1C5 SGTNVTFASG KGTTATVSKD DQGNITVMYD VNVGDALNVN QLQNSGWNLD 

C5 
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401 450 
H41 SKAVAGSSGK VISGNVSPSK GK MDETVNIN AGNNIE ITRN GKNIDIATSM 
PMC 21 SKAVAGSSGK VISGN VSPSK GKMDETVMIN AGNNIEIT RN GKNIDIATSM 
H41Studel SKAVAGSSGK VISGNVSPSK GKMDETVNIN AGNNIEITRN GKNIDIATSM 
PMC21Bgldel SKAVAGSSGK VISGNVSPSK GKMDETVNIN AGNNIEITRN GKNIDIATSM 
PMC21C1C5 SKAVAGSSGK VISGNVSPSK GKMDETVNIN AGNNIEITRN GKNIDIATSM 

C5 



451 500 

H41 TPOFSSVSLG AGADAPTLSV DDEGALNVGS KDANKPVRIT NVAPGVKEGD 

PMC 21 TPOFSSVSLG AGADAPTLSV DG .DALNVGS KKDNKPVR IT NVAPGVKEGD 

H41Studel TPQFSSVSLG AGADAPTLSV DDEGALNVGS KDANKPVRIT NVAPGVKEGD 

PMC21Bgldel TPQFSSVSLG AGADAPTLSV DG.DALNVGS KKDNKPVRIT NVAPGVKEGD 

PMC21C1C5 TPQFSSVSLG AGADAPTLSV DG.DALNVGS KKDNKPVRIT NVAPGVKEGD 

C5 



501 550 
H41 VTNVAOLKGV AONLNNRIDN VNGNARAGIA OAIATAGLVO AYLPGKSM MA 
PMC21 VTNVAOLK GV AONLNNRIDN VDGNARAGIA QAIATAGLVQ AYLPGKSMMA 
H41Studel VTNVAQLKGV AQNLNNRIDN VNGNARAGIA QAIATAGLVQ AYLPGKSMMA 
PMC21Bgldel VTNVAQLKGV AQNLNNRIDN VDGNARAGIA QAIATAGLVQ AYLPGKSMMA 
PMC21C1C5 VTNVAQLKGV AQNLNNRIDN VDGNARAGIA QAIATAGLVQ AYLPGKSMMA 

C5 



551 600 

H41 IGGGTYLGE A GYAIGY SSIS AGGN WIIKGT A SGNSRGHFG ASASVGYQW . 

PMC 21 I GGGT YRGEA GYAIGYS SIS DGGNWTIKGT ASGNSRGHFG ASASVGYQW. 

H41Studel IGGGTYLGEA GYAIGYSSIS AGGNWIIKGT ASGNSRGHFG ASASVGYQW. 

PMC21Bgldel I GGGT YRGEA GYAIGYSSIS DGGNWIIKGT ASGNSRGHFG ASASVGYQW. 

PMC21C1C5 I GGGT YRGEA GYAIGYSSIS DGGNWIIKGT ASGNSRGHFG ASASVGYQW. 

C5 
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52 


NNEEQEEYL 


YLHPVQRTVA 


VLIVNSDKEG 


AGEKEKVEEN 


S DWAVYFNF K 

i—> 1— ' v v V X X XM X_j x\. 


101 


GVLTAREITL 


KAGDNLKIKQ 


NGTNFTYSLK 


KDLTDLTSVG 


TEKLSFSAHG 


151 


NKVNITSDTK 


GLN FAKE TAG 


TNGDTTVHLN 


GIGSTLTDTL 

VJ -J- V_J X XJ X X XJ 


T.NTGATTKfVT 

XJ IN ± \JJri. J. x LX V x 


201 


NDNVTDDEKK 


RAASVKDVLN 


AGWNIKGVKP 


GTTASDNVDF 

X X ilU LJ L\ V i-J X 


VRTYDTVF FT, 

V x\ 1 1JJ1 V SZj x xj 


251 


SADTKTTTVN 


VESKDNGKKT 


EVKIGAKTSV 


IKEKDGKLVT 


GKDKGENGSS 


301 


TDEGEGLVTA 


KEVI DAVNKA 


GWRMKTTTAN 


GQTGQADKFE 


TVTSGTNVTF 


351 


ASGKGTTATV 


SKDDQGNITV 


MYDVNVGDAL 


NVNQLQNSGW 


NLDSKAVAGS 


401 


SGKVISGNVS 


PSKGKMDETV 


NINAGNNIEI 


TRNGKNIDIA 


TSMTPQFSSV 


451 


SLGAGADAPT 


LSVDGDALNV 


GSKKDNKPVR 


ITNVAPGVKE 


GDVTNVAQLK 


501 


GVAQNLNNRI 


DNVDGNARAG 


IAQAIATAGL 


VQAYLPGKSM 


MAIGGGTYRG 


551 


EAGYAIGYSS 


ISDGGNWIIK 


GTASGNSRGH 


FGASASVGYQ 


W* 



FIG. 14A 



52 


TDEDEEEEL 


ESVQRSVVGS 


IQASMEGSVE 


LETISLSMTN 


DSKEFVDPYI 


101 


VVTLKAGDNL 


KIKQNTNENT 


NASSFTYSLK 


KDLTGLINVE 


TEKLSFGANG 


151 


KKVNIISDTK 


GLN FAKE TAG 


TNGDTTVHLN 


GIGSTLTDML 


LNTGATTNVT 


201 


NDNVTDDEKK 


RAASVKDVLN 


AGWNIKGVKP 


GTTASDNVDF 


VRTYDTVEFL 


251 


SADTKTTTVN 


VESKDNGKKT 


EVKIGAKTSV 


IKEKDGKLVT 


GKGKGENGSS 


301 


TDEGEGLVTA 


KEVI DAVNKA' 


GWRMKTTTAN 


GQTGQADKFE 


TVTSGTKVTF 


351 


ASGNGTTATV 


SKDDQGNITV 


KYDVNVGDAL 


NVNQLQNSGW 


NLDSKAVAGS 


401 


SGKVISGNVS 


PSKGKMDETV 


NINAGNNIEI 


TRNGKNIDIA 


TSMTPQFSSV 


451 


SLGAGADAPT 


LSVDDEGALN 


VGSKDANKPV 


RITNVAPGVK 


EGDVTNVAQL 


501 


KGVAQNLNNR 


I DNVNGNARA 


GIAQAIATAG 


LVQAYLPGKS 


MMAIGGGTYL 


551 


GEAGYAIGYS 


SISAGGNWII 


KGTASGNSRG 


HFGASASVGY 


QW* 
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52 


NNETDLTSV 


GTEKLSFSAN 


GNKVNITSDT 


KGLNFAKETA 


GTNGDTTVHL 


101 


NGIGSTLTDT 


LLNTGATTNV 


TNDNVTDDEK 


KRAASVKDVL 


NAGWNIKGVK 


151 


PGTTASDNVD 


FVRTYDTVEF 


LSADTKTTTV 


NVESKDNGKK 


TEVKIGAKTS 


201 


VIKEKDGKLV 


TGKDKGENGS 


STDEGEGLVT 


AKEVIDAVNK 


AGWRMKTTTA 


251 


NGQTGQADKF 


ETVTSGTNVT 


FASGKGTTAT 


VSKDDQGNIT 


VMYDVNVGDA 


301 


LNVNQLQNSG 


WNLDSKAVAG 


SSGKVISGNV 


SPSKGKMDET 


VNINAGNNIE 


351 


ITRNGKNIDI 


ATSMTPQFSS 


VSLGAGADAP 


TLSVDGDALN 


VGSKKDNKPV 


401 


RITNVAPGVK 


EGDVTNVAQL 


KGVAQNLNNR 


I DNVDGNARA 


GIAQAIATAG 


451 


LVQAYLPGKS 


MMAIGGGTYR 


GEAGYAIGYS 


SISDGGNWII 


KGTASGNSRG 


501 


HFGASASVGY 


QW* 









FIG. 14C 



52 TDETGLINV ETEKLSFGAN GKKVNIISDT KGLNFAKETA GTNGDTTVHL 

101 NGIGSTLTDM LLNTGATTNV TNDNVTDDEK KRAASVKDVL NAGWNIKGVK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS 

201 VIKEKDGKLV TGKGKGENGS STDEGEGLVT AKEVIDAVNK AGWRMKTTTA 

251 NGQTGQADKF ETVTSGTKVT FASGNGTTAT VSKDDQGNIT VKYDVNVGDA 

301 LNVNQLQNSG WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE 

351 ITRNGKNIDI ATSMTPQFSS VSLGAGADAP TLSVDDEGAL NVGSKDANKP 

401 VRITNVAPGV KEGDVTNVAQ LKGVAQNLNN RIDNVNGNAR AGIAQAIATA 

451 GLVQAYLPGK SMMAIGGGTY LGEAGYAIGY SSISAGGNWI IKGTASGNSR 

501 GHFGASASVG YQW* 
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52 NNVDFVRTY DTVEFLSADT KTTTVNVESK DNGKKTEVKI GAKTSVIKEK 

101 DGKLVTGKDK GENGSSTDEG EGLVTAKEVI DAVNKAGWRM KTTTANGQTG 

151 QADKFETVTS GTNVTFASGK GTTATVSKDD QGNITVMYDV NVGDALNVNQ 

201 LQNSGWNLDS KAVAGSSGKV ISGNVSPSKG KMDETVNINA GNNIEITRNG 

251 KNIDIATSMT PQFSSVSLGA GADAPTLSVD GDALNVGSKK DNKPVRITNV 

301 APGVKEGDVT NVAQLKGVAQ NLNNRIDNVD GNARAGIAQA IATAGLVQAY 

351 LPGKSMMAIG GGTYRGEAGY AIGYSSISDG GNWIIKGTAS GNSRGHFGAS 

4 01 ASVGYQW* 

FIG. 14E 



52 


NRAASVKDV 


LNAGWNIKGV 


KPGTTASDNV 


DFVRTYDTVE 


FLSADTKTTT 


101 


VNVESKDNGK 


KTEVKIGAKT 


SVIKEKDGKL 


VTGKDKGENG 


SSTDEGEGLV 


151 


TAKEVIDAVN 


KAGWRMKTTT 


ANGQTGQADK 


FETVTSGTNV 


TFASGKGTTA 


201 


TVSKDDQGNI 


TVMYDVNVGD 


ALNVNQLQNS 


GWNLDSKAVA 


GSSGKVISGN 


251 


VSPSKGKMDE 


TVNINAGNNI 


EITRNGKNID 


IATSMTPQFS 


SVSLGAGADA 


301 


PTLSVDGDAL 


NVGSKKDNKP 


VRITNVAPGV 


KEGDVTNVAQ 


LKGVAQNLNN 


351 


RIDNVDGNAR 


AGIAQAIATA 


GLVQAYLPGK 


SMMAIGGGTY 


RGEAGYAIGY 


401 


SSISDGGNWI 


IKGTASGNSR 


GHFGASASVG 
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FIG. 14F 

50 SANTLKAGDNL KIKQFTYSLK KDLTDLTSVG TEKLSFSANG NKVNITSDTK 

101 GLN FAKE TAG TNGDTTVHLN GIGSTLTDRA ASVKDVLNAG WNIKGVKNVD 

151 FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS VIKEKDGKLV 

201 TGKDKGENGS STDEGEGLVT AKE VI DAVNK AGWRMKTTTA NGQTGQADKF 

251 ETVTSGTNVT FASGKGTTAT VSKDDQGNIT VMYDVNVGDA LNVNQLQNSG 

301 WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE ITRNGKNIDI 

351 ATSMTPQFSS VSLGAGADAP TLSVDGDALN VGSKKDNKPV RITNVAPGVK 

4 01 EGDVTNVAQL KGVAQNLNNR IDNVDGNARA GIAQAIATAG LVQAYLPGKS 

451 MMAIGGGTYR GEAGYAIGYS SISDGGNWII KGTASGNSRG HFGASASVGY 

501 QW* 



FIG. 14G 



